On the Effect of Semantically Enriched Context Models on Software Modularization

نویسندگان

  • Amir Saeidi
  • Jurriaan Hage
  • Ravi Khadka
  • Slinger Jansen
چکیده

Many of the existing approaches for program comprehension rely on the linguistic information found in source code, such as identifier names and comments. Semantic clustering is one such technique for modularization of the system that relies on the informal semantics of the program, encoded in the vocabulary used in the source code. Treating the source code as a collection of tokens loses the semantic information embedded within the identifiers. We try to overcome this problem by introducing context models for source code identifiers to obtain a semantic kernel, which can be used for both deriving the topics that run through the system as well as their clustering. In the first model, we abstract an identifier to its type representation and build on this notion of context to construct a contextual vector representation of the source code. The second notion of context is defined based on the flow of data between identifiers to represent a module as a dependency graph where the nodes correspond to identifiers and the edges represent the data dependencies between identifiers. We have applied our approach to 10 medium-sized Java projects, and show that by introducing contexts for identifiers, the quality of the modularization of the software systems is improved. Both of the context models give results that are superior to the plain vector representation of documents. In some cases, the authoritativeness of decompositions is improved by 66%. Furthermore, a more detailed evaluation of our approach on JEdit, an open source editor, demonstrates that inferred topics through performing topic analysis on the contextual representations are more meaningful compared to the plain representation of the documents. The proposed approach in introducing a context model paves the way for building tools that support developers in program comprehension tasks such as domain concept location and topic analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

First Language Activation during Second Language Lexical Processing in a Sentential Context

 Lexicalization-patterns, the way words are mapped onto concepts, differ from one language      to another. This study investigated the influence of first language (L1) lexicalization patterns on the processing of second language (L2) words in sentential contexts by both less proficient and more proficient Persian learners of English. The focus was on cases where two different senses of a polys...

متن کامل

The Effect of Teaching Vocabulary through Synonymous, Semantically Unrelated, and Hyponym Sets on EFL Learners’ Retention

Many textbooks include semantically related words and sometimes teachers add synonyms, antonyms, etc. to the words in order to present new vocabulary items without questioning the possible effects. This study sought to investigate the effect of teaching vocabulary through synonym, semantically unrelated, and hyponym sets based on Higa’s (1963) proposed continuum. A total of 120 Iranian intermed...

متن کامل

Domain Drivers in the Modularization of FLOSS Systems

The classification of software systems into types has been achieved in the past by observing both their specifications and behavioral patterns: the SPE classification, for instance, and its further supplements and refinements, has identified the S-type (i.e., fully specified), the P-type (i.e., specified but dependent on the context) and the E-type (i.e., addressing evolving problems) among the...

متن کامل

Ontology Modularization for Knowledge Selection: Experiments and Evaluations

Problems with large monolithical ontologies in terms of reusability, scalability and maintenance have led to an increasing interest in modularization techniques for ontologies. Currently, existing work suffers from the fact that the notion of modularization is not as well understood in the context of ontologies as it is in software engineering. In this paper, we experiment on applying state-of-...

متن کامل

Connecting Business and IT - A Model-driven Web Service based Approach

Service-oriented architectures focus on the alignment of business processes and the supporting information technology. To facilitate the alignment and to decrease administration costs we propose linking of SOA services to enterprise models respectively target state organisational models by using semantically enriched conceptual modelling languages. Acting on language meta-level means in this co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Programming Journal

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2018